Probabilistic Latent Tensor Factorization for 3-way Microarray Data Analysis with Missing Values
نویسندگان
چکیده
The recent advances in microarray technology enabled the measurement of gene expression levels of samples over a series of time points. Unlike the traditional 2D microarray data, such experiments generate 3D (gene-sample-time) microarray data, which require specialized methods for analysis. In this study, we propose a novel tensor factorization model for modeling 3D microarray data. The model assumes the existence of certain temporal patterns that are repeated over time. One main advantage of the model is that it handles the missing data implicitly, so that the estimation process is not effected by the existence of missing values, which commonly occur in microarray data. We evaluate our model on classification of the good or bad responders to Interferon beta (INFβ) treatments by using a real gene-sample-time microarray data set and achieve a promising prediction performance.
منابع مشابه
Variational Inference For Probabilistic Latent Tensor Factorization with KL Divergence
Probabilistic Latent Tensor Factorization (PLTF) is a recently proposed probabilistic framework for modelling multi-way data. Not only the common tensor factorization models but also any arbitrary tensor factorization structure can be realized by the PLTF framework. This paper presents full Bayesian inference via variational Bayes that facilitates more powerful modelling and allows more sophist...
متن کاملProbabilistic Tensor Factorization for Tensor Completion
Multi-way tensor datasets emerge naturally in a variety of domains, such as recommendation systems, bioinformatics, and retail data analysis. The data in these domains usually contains a large number of missing entries. Therefore, many applications in those domains aim at missing value prediction, which boils down to a tensor completion problem. While tensor factorization algorithms can be a po...
متن کاملScalable Probabilistic Tensor Factorization for Binary and Count Data
Tensor factorization methods provide a useful way to extract latent factors from complex multirelational data, and also for predicting missing data. Developing tensor factorization methods for massive tensors, especially when the data are binaryor count-valued (which is true of most real-world tensors), however, remains a challenge. We develop a scalable probabilistic tensor factorization frame...
متن کاملLiver CT Annotation via Generalized Coupled Tensor Factorization
This study deals with the missing answers prediction problem. We address this problem using coupled analysis of ImageCLEF2014 dataset by representing it as a heterogeneous data, i.e., dataset in the form of matrices. We propose to use an approach based on probabilistic interpretation of tensor factorization models, i.e., Generalized Coupled Tensor Factorization, which can simultaneously fit a l...
متن کاملMulti-HDP: A Non Parametric Bayesian Model for Tensor Factorization
Matrix factorization algorithms are frequently used in the machine learning community to find low dimensional representations of data. We introduce a novel generative Bayesian probabilistic model for unsupervised matrix and tensor factorization. The model consists of several interacting LDA models, one for each modality. We describe an efficient collapsed Gibbs sampler for inference. We also de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012